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Abstract Allometric growth is found in many tagging systems online. That 
is, the number of new tags (T) is a power law function of the active population 
(P), or T ~ P 7 (77^ 1). According to previous studies, it is the heterogene- 
. ity in individual tagging behavior that gives rise to allometric growth. These 

studies consider the power- law distribution model with an exponent f3, re- 
garding 1//3 as an index for heterogeneity. However, they did not discuss 
whether power-law is the only distribution that leads to allometric growth, 
or equivalently, whether the positive correlation between heterogeneity and 
allometric growth holds in systems of distributions other than power-law. In 
this paper, the authors systematically examine the growth pattern of sys- 
tems of six different distributions, and find that both power-law distribution 
and log-normal distribution lead to allometric growth. Furthermore, by in- 
\ traducing Shannon entropy as an indicator for heterogeneity instead of 1//3, 

^\ • the authors confirm that the positive relationship between heterogeneity and 

allometric growth exists in both cases of power-law and log-normal distribu- 
tions. 
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1 Introduction 

Studies on human collaborative tagging behavior discover the allometric 
growth PQ0[3][U of online tagging systems. That is, the power law rela- 
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tionship T ~ P 1 (7 ^ 1) between the number of new tags (T) and the 
active population (P). Previous studies tend to adopt a power- law distribu- 
tion model (with an exponent /3 ) to explain the allometric growthQ][2][3][5]. 
Although these models may be different from each other in details, they 
agree on the assumption that heterogeneity in individual tagging behavior 
leads to the allometric growth. Moreover, [1][5] suggest that if we regard 
the reciprocal of f3 as an indicator of heterogeneity, the positive correlation 
between heterogeneity and the allometric growth rate 7 can be found. 
However, these studies do not discuss whether the power law distribution is 
the only distribution that gives rise to allometric growth, neither do they con- 
sider whether the positive correlation between heterogeneity and allometric 
growth also exist in systems of distributions other than power-law. Without 
answering these questions, it is hasty to derive the conclusion that hetero- 
geneity leads to allometric growth. In the current study, we find that both 
power-law distribution and log-normal distribution lead to allometric growth. 
Further, we suggest that if we regard rescaled Shannon entropy as a more 
general indicator of heterogeneity than 1//3, there is supporting evidence for 
the positive relationship between heterogeneity and allometric growth in both 
systems of power-law and log-normal distributions. As allometric growth has 
been widely discovered in many different social systems online 4 [6 [7 [8 
and offline [9] [10] |1 lj [ 12 ] . our findings on the relationship between Shannon 
entropy and allometric growth may be useful in studying these systems. 



2 The allometric growth of social annotation and the explanatory 
model of power law distribution 

Fig. Q] shows the examples of allometric growth. To explain the growth pat- 
tern, [T] proposes that if the empirical daily distribution of individual tagging 
activities confirms a power-law distribution invariant of system size (popu- 
lation), such a growth pattern can be observed. The power-law distribution 
model can be expressed as 

n{t) = {t/t max y 13 (1) 

Where t is the number of tags with a maximum value t max and n(t) is the 
number of users that generate so many tags. [T] suggests that the positive 
relationship between (3 and 7 can be analytically derived as 

(2//3 iil</3<2 
7 ~ \ 1 if p > 2 [A> 

Eq. [2] is confirmed by empirical data and numerical simulations in [I] . 
Therefore power law distribution with an exponent smaller than 2 gives rise 
to allometric growth. In other words, accelerating growth usually appears in 
systems that are heterogeneous in resources or activities. In [5] and [3], the 
analytical relationships between j3 and 7 are slightly different from Eq. [5J but 
similar in terms of supporting the positive relationship between heterogeneity 
and allometric growth. 
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Fig. 1 Accelerating growth in two online tagging systems(Flickr and Delicious). 
Different colors indicate different types of systems. Each point is plotted by counting 
the number of new tags (y) and the population of active user (x) in a system in one 
day. The scaling exponent 7 estimated by ordinary least square (OLS) regression 
in the Log-Log scale plot is drawn in dark line for eye guidance. The figure is cited 
from [I]. 



3 Both power-law distributions and log-normal distributions lead 
to allometric growth 

Is power-law the only distribution leading to allometric growth? Previous 
studies [I] [2] [2] [5] did not consider this question. To answer the question, 
we numerically simulate systems of six types of distributions and observe 
the growth rate (7) of these systems. The detailed information of simulation 
is listed in Table 1. It turns out that, both the Pareto distribution with 
an exponent a between 0^1 and the log-normal distribution give rise to 
allometric growth (i.e., 7 > 1). The observation that Pareto distribution leads 
to accelerating growth when its exponent a ranges from to 1 is consistent 
with Eq. [3J since Pareto distribution (with an exponent a) can be viewed 
as the cumulative distribution function of power law distribution (with an 
exponent /3), and a — /3 — 1. 

However, it is obvious that Eq.[2]can not be used to explain the allometric 
growth in the case of log-normal distribution. Does the relationship between 
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Table 1 Parameter variance and growth rate (7) in systems of different distribu- 
tions 



Normal Weibull Poisson Gamma Log- normal Pareto-1 Pareto-2 

Parameters 1< ju < 10 1 < a < 10 O.K /x < 10 1< a < 10 K/x<10 1< k < 10 1 < fc < 10 

O.K a < 10 O.K P < 10 0.1 < < 10 0.1 < a < 10 0.1 < a < 1 1< a < 10 

Mean of 7 1.00 1.00 1.00 1.00 1.60 3.02 1.00 

S.D. of 7 0.01 < 0.01 0.01 < 0.01 0.92 2.94 0.03 

N of simulations 400 400 40 400 400 200 200 



heterogeneity and accelerating growth also exist in systems of non-power- 
law distributions (e.g., log-normal distribution)? To answer this question, we 
should find another indicator for heterogeneity other than 1 //3 . In the current 
research, we recommend to use Shannon entropy 13 as the new indicator . 



4 Rescaled Shannon entropy as an indicator for heterogeneity 

As aforementioned, we introduce the Shannon entropy H as an new indicator 
for heterogeneity because the indicator 1//3 does not apply in log-normal 
distribution. To test the assumption that heterogeneity leads to allometric 
growth, we shall examine the relationship between H and 7 in both cases of 
power-law distribution and lognormal distribution. 

According to [T3], the value of entropy Hi in power- law distribution 
f(x) — C l3 ~ 1 x~P (J3 — 1) can be calculated as 

fli =k 1 log(j-j) + h 2 -^ + k 3 (3) 

Where ki, fe, and £3 are constant coefficients to be estimated. As H\ 
in Eq. [3] is a system-size dependent variable, to facilitate the comparisons 
between entropies of systems different in size N, given H max = Nlog(N), we 
calculate the rescaled value of Hi as H = H\/Nlog{N). In Fig. O We show 
how H increases with C and decreases with /3 in numerical simulations. By 
fitting the curves in Fig. [2] with Eq. [JJ we calculate the minimum value of 
entropy H t — 0.586, which is achieved when C = 1 and (3 = 2. In another 
word, a system of power-law distribution will not show allometric growth 
unless its entropy is greater than H t = 0.586. 

In Fig. [3J we show the relationship between H and 7 in simulated systems 
of power-law and log-normal distributions. It turns out that our finding on 
the threshold value of entropy H t — 0.586 applies in both cases. We also plot 
the minimum entropy of two social tagging systems, Delicious and Flickr. In 
calculating their entropies, we set C = 1 and substitute the empirical values 
of (3 (Fig. [IJ into Eq. [3J We find the behavior of two empirical systems is 
consistent with our findings. According to the results shown in Fig. [3J the 
positive relationship between the rescaled Shannon entropy H and allometric 
growth rate 7 is supported both by empirical data and simulation. 



5 



0.6 - 




Fig. 2 The relation between Shannon entropy (H) and growth rate (7). The result 
of numerical simulations (with parameters set as 1 < /3 < 10 and 1 < C < 10 in the 
power law model f(x) = C f3 ~ 1 x~ /3 (/? — 1)) are plotted in dots whose color varies 
with C. The dots are fitted by function Eq. 0with OLS regression. The estimated 
curves are shown in purple lines. The green points with large size are the thresholds 
which are reached when /3 — 2 and C varies from 1 to 10. The inset shows how the 
value of threshold increases with the value of C. The minimum value of Ht = 0.586 
is achieved when C = 1 and ft = 2. 
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Fig. 3 The relationship between rescaled heterogeneity (H) and growth rate (7). 
Both Pareto distribution with an exponent ranging from to 1 (Pareto 1) and log- 
normal distribution lead to accelerating growth (i.e. ,7 > 1). Moreover, in both of 
the two cases, the accelerating growth would not appear until the entropy threshold 
H t = 0.586 is reached. 
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5 Discussion 

Previous studies uncover the allometric growth pattern of social tagging sys- 
tems and explain the pattern by power-law distribution model [I] [3] [5] . 
The authors in this paper systematically investigate the growth pattern of 
systems of different distributions, and find that log-normal distribution also 
leads to allometric growth. With regards to the principle derived from the 
power-law distribution model that heterogeneity leads to allometric growth 
PQ [5] , the authors propose that if we used rescaled Shannon entropy to rep- 
resent the heterogeneity of individual tagging behavior, the principle holds 
in both cases of power-law distribution and log-normal distribution. 

It is possible for the findings of this paper find their applications in the 
information industry. For instance, the positive relationship between hetero- 
geneity and allometric growth indicates that to enjoy fast growth of users, 
webmasters should keep the threshold of their websites as low as possible to 
attract users as diverse as possible. 

Some questions left point out the direction of future research. Why do the 
systems of power-law distribution and log-normal distribution, but not other 
distributions, show the properties of allometric growth? Can the relationship 
between heterogeneity and allometric growth be generalized to other online 
or even offline social system? These important questions call for further ex- 
ploration. 
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